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Abstract. The quotient complexity, also known as state complexity, of 
a regular language is the number of distinct left quotients of the lan- 
guage. The quotient complexity of an operation is the maximal quotient 
complexity of the language resulting from the operation, as a function 
of the quotient complexities of the operands. The class of star-free lan- 
guages is the smallest class containing the finite languages and closed 
under boolean operations and concatenation. We prove that the tight 
bounds on the quotient complexities of union, intersection, difference, 
symmetric difference, concatenation, and star for star-free languages are 
the same as those for regular languages, with some small exceptions, 
whereas the bound for reversal is 2" — 1. 

Keyvirords: aperiodic, automaton, complexity, language, operation, quo- 
tient, regular, star-free, state complexity 



1 Introduction 

The class of regular languages can be defined as the smallest class containing 
the finite languages and closed under union, concatenation and star. Since reg- 
ular languages are also closed under complementation, one can redefine them 
as the smallest class containing the finite languages and closed under boolean 
operations, concatenation and star. In this new formulation, a natural question 
is that of the generalized star height of a regular language, which is the minimum 
number of nested stars required to define the language when boolean operations 
are allowed. It is not clear who first considered the problem of generalized star 
height, but McNaughton and Papert reported in their 1971 monograph [13] that 
this problem had been open "for many years" . There exist regular languages of 
star height and 1, but it is not even known whether there exists a language of 
star height 2. See |http://liaf a. jussieu.fr/~jep/Problemes/starheight .html | . 

Wc consider regular languages of star height 0, which are also called star- free. 
In 1965, Schiitzenbcrgcr proved [16] that a language is star-free if and only if its 
syntactic monoid is group-free, that is, has only trivial subgroups. An equivalent 
condition is that the minimal deterministic automaton of a star-free language is 
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permutation-free, that is, has only trivial permutations. Another point of view is 
that these automata are counter-free, since they cannot count modulo any integer 
greater than 1. They can, however, count to a threshold, that is 1, 2, ... n — 1, n 
or more. Such automata are called aperiodic, and this is the term that we use. 

The state complexity of a regular language |17| is the number of states in 
the minimal deterministic finite automaton accepting that language. We prefer 
the equivalent concept of quotient complexity [2| , which is the number of distinct 
left quotients of the language, because quotient complexity has some advantages. 
The quotient complexity of an operation in a subclass of regular languages is the 
maximal quotient complexity of the language resulting from the operation, as a 
function of the quotient complexities of the operands when they range over all 
the languages in the subclass. The complexities of basic operations in the class of 
regular languages were studied by Maslov [T3] and Yu, Zhuang and Salomaa [15] . 

The complexities of operations were also considered in several subclasses of 
regular languages: unary |15|18| . finite |7|17j . ideal [l], closed [6], prefix- free [TT] . 
suffix- free [T^, bifix-, factor-, and subword-free [5], and convex [5]. The complex- 
ity of operations can be significantly lower in a subclass of regular languages than 
in the general case. We prove that this is not the case for star-free languages, 
which meet the bounds for regular languages, with small exceptions. 

It was shown in [I] that the tight bound for converting an n-state aperiodic 
nondcterministic automaton to a deterministic one is 2". 

In Section [5] we define our terminology and notation. Boolean operations, 
concatenation, star, and reversal are studied in Sections |3HS1 respectively. Unary 
languages are treated in Section [3 and Section |5] concludes the paper. 

2 Terminology and Notation 

If is a finite non-empty alphabet, then S* is the set of all words over this 
alphabet, with e as the empty word. For w G S*, a & S, let \w\ be the length 
of w, and \w\a, the number of a's in w. A language is any subset of S* . 

We use the following set operations on languages: complement (L = S* \L), 
union (KUL), intersection (KClL), difference {K\L), and symmetric difference 
[K © L). We also use product, also called (con) catenation [KL = {w € S* \ 
w = uv,u G K,v £ L}) and star {K* = Ui>o^*)- '^^^ reverse w'^ of a word 
w G S* is defined by: = e, and (wa)^ — aw^ . The reverse of a language L 
is = {w^ I w e L}. 

Regular languages are the smallest class of languages containing the finite 
languages and closed under boolean operations, product and star. Star-free lan- 
guages are the languages one can construct from finite languages using only 
boolean operations and concatenation. Some examples of star-free languages are 
0, S* 0, 6* = I]*aS* = 0a0 over S = {a, b}, and aa* = e over S = {a}. We 
do not write such expressions for star-free languages, but denote them as usual. 

The (left) quotient of a language L by a word w is defined as = {x G 
S* I wx e L}. The number of distinct quotients of a language is called its 
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quotient complexity and is denoted by k{L). A quotient is accepting if e £ i^,; 
otherwise it is rejecting. 

A deterministic finite automaton (DFA) is a quintuple V = (Q, IJ,d,qo, F), 
where Q is a finite set of states, 17 is a finite alphabet, S : Q x S ^ Q is the 
transition function, qq is the initial state, and C Q is the set of final or 
accepting states. As usual, the transition function is extended to Q x S* . A DFA 
T) accepts w G S* if S{qQ,w) G F, and the language accepted by 2? is L{T>). 
The language of a state q of 2? is the language ig accepted by the automaton 
{Q, S, 6, q, F). If the language of a state is empty, that state is empty. 

Let L'^ ~ e ii e € L, and L'^ = 0, otherwise. The quotient automaton of a 
regular language LisV — {Q, S, 5, qo, F), where Q — {i^, \ w G U*}, 5{Lyj, a) = 
Lwa, qa = Le ^ L, F ^ {L^ \ = e}, and = (L.^,)^. Since this is the 
minimal DFA accepting L, the quotient complexity of L is equal to the state 
complexity of L, and we call it simply complexity. 

A transformation of a set 5 = {1, . . . , n} into itself is a mapping 



where ik & S for 1 ^ fc ^ n. Each word in i7* performs a transformation of the 
set Q of states of a DFA P. A DFA is aperiodic if no word performs a permuta- 
tion, other than the identity permutation, of a subset of Q. Since testing if a DFA 
is aperiodic is PSPACE-complete [5], we use a subclass of aperiodic automata. 
Without loss of generality, we assume that Q = {1, . . . ,n}. A transformation is 
non- decreasing if j < k implies ij ^ ife. A non-decreasing transformation cannot 
have a non-trivial permutation, and the composition of non-decreasing transfor- 
mations is non-decreasing. Hence a DFA with non-decreasing input transforma- 
tions is aperiodic. 

A nondeterministic finite automaton (NFA) is defined as a quintuple J\f = 
{Q,S,ri,I,F), where Q, S, and F are as in a DFA, 77 : Q x — !> 2*5 is the 
transition function and J C Q is the set of initial .states. If rj also allows e, that 
is, 77 : Q X (r U {£}) ^ 2'3, we cah J\f an e-TVFyl. 

3 Boolean Operations 

We now consider the quotient complexity of union, intersection, symmetric dif- 
ference, and difference in the class of star-free languages. The upper bound for 
these four operations in the class of regular languages is mn |2I13I18| . 

Theorem 1. For each of the operations union, intersection, .symmetric differ- 
ence, and difference, there exist binary star-free languages K and L with quotient 
complexities m ^ 1 and n ^ 1, respectively, that meet the bound mn. 

Proof. Let S = {a, b}. We examine union first. For m = 1, let A' = and let L be 
any binary star-free language with k{L) = n. Then k{K U L) = k{L) = n = mn. 
Similarly, if ri = 1, let L = and let K be any binary star- free language with 
k{K) — m. Then k(K U L) — mn. 
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Fig. 2. Quotient automaton of K U L. 



For m,n ^ 2, let K = {b*a)"'-'^b* ^ {w £ S* \ \w\a ^ m - 2}, and 
L = {a*b)"'-'^a* = {w € S* \ \w\b = n - 2}; then k{K) = m and k{L) = n, and 
both K and L are star- free. The quotient automata of K and L are in Fig. [T] for 
m = 4 and n = 5, and their direct product for K U in Fig. [2j 

Let A/ = ii' U i, and consider the quotients of M by the mn words a^V , 
i = 0, . . . , m — 1, and j = 0, . . . , n — 1; these quotients Mg^n^j correspond to 
states (i + l,j + 1) in the direct-product automaton for M. We begin with the 
rejecting quotients of M . First, Mam -1^,1-1 = 0, and all the other quotients are 
non-empty. Next, if i < m — 2 and j < n~2 (rows 1 to m — 2, columns 1 to n — 2), 
then the pair (a™^^^*, fo"^^^-*) of non-empty words belongs to M^ibi and to no 
other rejecting quotient. If i < m — 2, then M^if,n-i (rows 1 to m — 2, column n) 
contains a™~^~*, but has no words from b* . If j < n — 2, then Mg^m-ifyj (row m, 
columns 1 to n — 2) contains fe"^^^^ , but has no words from a* . So all rejecting 
quotients are distinct. 

Now turn to the accepting quotients. For i, fc ^ m — 2, quotient Mg^ibn~2 (rows 
1 to TO — 1, column n — 1) contains 6a™~^~*, and this word is not contained in 
any other quotient with k ^ i, and M^m-if,n-2 has no words from 6a*. 

Thus all the quotients in column n — 1 are distinct. For j, £ ^ n — 2, Mg^m-^^j 
(row TO— 1, columns \ io n~ 1) contains ab"~'^~^ , and this word is not contained 
in any other quotient M„m-2^f with £ 7^ j, and Mam-2i,n.-i has no words from 
ab* . Thus all the quotients in row m — 1 are distinct. Excluding M^m-2bn-2^ 
each quotient in column n — 1 contains a but not b, each quotient in row to — 1 
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contains b but not a, and Mi^m-2f,n-2 contains both a and b. Hence all accepting 
quotients arc distinct, and our claim holds for union. 

For difference, we can use K and L, where K and L meet the bound mn for 
union, because k(K \L) = k(K n L) = k(K n L) = k{K U L). 

For intersection, it was shown in [4] that the languages K = {b*a)"^^^ S* = 
{w G E* \ \w\a ^ m - 1} and L = {a*b)"-'^S* = {w € S* \ \w\b ^ n - 1} meet 
the bound mn. Since both languages are star-free, our claim holds for intersec- 
tion. These languages also meet the bound mn for symmetric difference [4]. □ 

4 Product 

The tight bound for product of regular languages |13|18j is (to — 1)2" + 2"^^. We 
show that this bound can be met by star-free languages, with some exceptions. 

In subset constructions, we use the notation S T to mean that subset S 
under input word w moves to subset T. 

Theorem 2. There exist quaternary star- free languages K and L with quotient 
complexities m ^ 1 andn ^ 3, respectively, such thatK(KL) ~ (to — l)2"-|-2"^^. 

Proof. Let the quotient automaton for K be Vk ~ {Qk, E,SK,qoj Fk), where 
Qk = {qi,q2, ■ ■ ■ ,gm}, S = {a,b,c,d}, go = qi, Fk = {<?m}, and 



Next, let the quotient automaton for L be 2?l = (Ql, E, SljPq, Fl), where 
Ql = {1, 2, . . . , n}, S = {a, b, c, d}, po = 1, Fl = {n - 1}, and 

S^li, c) = i + 1 for i = 1, . . . , n — 1, (5i(n, c) = n, 

Shih d) = j — 1 for i = 2, . . . , n, d) = 1, 

5l(i, a) = i + 1 for i = 2, . . . , n — 1, (5^(1,0) ==1, (5l(«, a) = n, 

(5^(1, b) ~ i for i = 1, . . . ,n. 

The automaton for jti = 4 is shown in Fig. [31 where the transition labeled e 
should be ignored for now. The automaton 2?^ for n = 5 is also shown in Fig. [31 
If the transition labeled e is taken into account and q^ is made a rejecting state, 
then we have an e-NFA for KL. Here the initial state is qi, the set of accepting 
states is {4}, and the transitions are as shown. 

For ls^Sk^n-1, S = {si, . . . , Sk}, si < S2 < ■ ■ ■ < Sk, Si e Ql, and 
^ a; ^ ri — Sfc, denote (si + x, . . . , Sk + x) by S+x- Similarly, for 2 ^ si ^ n, 
and ^ a; ^ si — 1, denote (si — x, . . . , — x) by S-x. 

We first show by induction on the size of S that all (to — 1)2"~^ subsets of 
the form {qi} U 5, where qi e Qk, qi 7^ q-m, and S C \ {!}, are reachable. 



SK{qi,a) 
Sxiqijb) 
6K{qi,c) 
5Kiqi,d) 



q.,+i for i = 1, . . . , m - 1, Sxiqm, a) = q, 
(7,-1 for i = 2, . . . ,m, (5^ (gi , 6) = (?i , 
q.; for i = 1, . . . , TO, 
for i 1, . . . ,TO. 
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Fig. 3. e-NFA M of KL. 

When S* = 0, the set {qi] is reached by a'~^, for i = l,...,m — 1. Now 
suppose we want to reach {qi} U T, where i ^ m, T — {sq, si, . . . , Sfc}, fc 0, 
and 1 < So < ■Si < ■ • ■ < ■Sfc. Let S* = {si, . . . , s^}; by the induction assumption, 

{qi\ U 5 is reachable. Then {qi\ U 5 {q™, 1} U S'-(,,o-i) {g., 1} U 

^.(sQ.i) ^ {qi} U {so} U S* = {gj U T . Thus {gj U T is also reachable. 

Next, we prove that the 2"~^ subsets of the form {q„n 1} U 5, where S is any 
subset of Ql \ {1}, are reachable. If m = 1, then {(7i,l} is the initial subset. 

Let S and T be be as above. Then {qi,l} U S '^'-^ {qiA} U S_(so-i) 

{gi, 1} U {2} U 5_(.„_2) {qi, 1} U {so} US= {gi, 1} U T. 

If m ^ 2, there are two cases. If 2 ^ 5, then start with {gi} U S, which has 

already been shown to be reachable. We then have {gi}U5 {g™, l}US'_i 
{g,„, 1} U 5. If 2 e 5, then start with {gi} US\ {2}. Now {gi} U 5 \ {2} ^ 
{qm, 1} U (5 \ {2})_i ^ {qrn, 1} U {2} U (S \ {2}) = {g™, 1} U S. 

Finally, we show that the (m — 1)2"~^ subsets of the form {g^, 1} U S, where 

i < m, and S C Qi^\ {1} are reachable. We have {g,„, 1} U 5 ^ — > {qi, 1} U 5. 

In summary, (m — 1)2" + 2"^^ different subsets are reachable. We now prove 
that all these subsets are pairwise distinguishable. 

For 1 ^ fc ^ n — 1, state k of accepts the word = c"^^^'^, and state n 
accepts the word Wn = d; moreover, each of these words Wh is accepted by only 
that one state h of Ql, and none of these words is accepted by state g^, if i m. 
Hence, if /i is in S* \ T or in T \ S', then S and T are distinguished by Wh. 

First, let 1 ^ i ^ j < m, and consider {qi}US and {gj}ur, where S,T (Z Q^, 
and 5 and T differ by state h. Then {g^} U 5 and {g^} U T are distinguished by 
Wh- Next, let 1 ^ i < J < m and take {gi} U S and {g^} U S, where S C Q^. 
First apply c; then we reach {g^} U i? and {gj} U R, where I ^ R. Then {g^} U R 
accepts a'"~-'c"~^, whereas {g^} U R rejects this word. 

Second, suppose S,T C Ql\{1} and S and T differ by state h; then {gm, 1}U 
S and {gm, 1} U T are distinguished by Wh- 

Third, consider {gJUS*, where S CQl and {g,„,l}UT, where T C Ql\{1} 
and i < m. Then c""^ is accepted by {qm, 1} U T but not by {g^} U S. 

Since all reachable sets are pairwise distinguishable, the bound is met. □ 
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Corollary 1. There exists a ternary star- free language L with quotient com- 
plexity n ^ 1, such that k{S*L) ~ 2"^^. 

Proof. If K = S*, the DFA Vk has one state, which is both initial and accepting. 
Now b is not needed in the proofs of reachability and distinguishability. □ 

A right (left) ideal g] is a language L satisfying L = LE* {L = E*L). 
If M = KU* {M ^ U*K), then M is the right (left) ideal generated by K. 
Corollary [T] shows that the bound 2"^^ on the quotient complexity of the left 
ideal generated by a regular language can also be met by a star-free language. 

If n = 1 in Theorem H then either KL = % and n{KL) = 1, or KL = KS* 
is the right ideal generated by K. In the second case, it is known [TB] that m is 
a tight bound for k{KE*), and that the language a™~^a* is a witness Since 
that witness is star-free, the general bound holds also for star-free languages. 

The case m ^ 2 and ri = 2 remains. For m = 77, = 2, the best bound for 
product of regular languages is 6, whereas it is 4 for star-free languages. This 
was verified with the GAP package Automata [H] by enumerating all products 
of 2-state aperiodic automata. 

There are only three types of inputs possible for a 2-state aperiodic DFA: 
the input that takes both states to state 1, the input that takes both states 
to state 2, and the identity input. If 1 is the accepting state, then subsets {1} 
and {1,2} are not distinguishable. Therefore a rejecting quotient of "Dk can 
appear with only three subsets of quotients oi Vl in the DFA of KL instead 
of 2^ — 4, and an accepting quotient, only with one subset instead of two. The 
complexity is maximized when there is only one accepting quotient of K. Hence 
k{KL) ^ (m — 1)3 -I- 1 = 3m — 2. If 2 is the accepting state, then {2} and {1, 2} 
are not distinguishable. Hence k{KL) ^ (m — 1)3 + 2 = 3m — 1 in this case. 

Theorem 3. There exist ternary star-free languages K and L with quotient 
complexities m ^ 2 and 2, respectively, such that k{KL) ~ 3m — 2. 

Proof. Let Vxia, b, c) be the DFA in the proof of Theorem [2] restricted to input 
alphabet {a, &, c}. Let Vl — ({1, 2}, {a, b, c}, S, 1, {1}), where 

^^(i, a) ~ i for 1 = 1,2, 

1 for i 1,2, 

5L{i,c) = 2 for i = 1,2. 

For i^m, subset {qi} is reached by a*~\ {gi}U{l}, by a"^~^b"^~\ and {(7i}U{2}, 
by a"^~^b'^~^c. Finally, {gm}U{l} is reached by a™"^. This gives 3m — 2 subsets. 

For i m, {qi} accepts no words from b*, {gi}U{l} accepts e, and {(7i}U{2} 
accepts b but not e. Hence subsets {qi}U S and {qi} UT with i,j ^ m, S,T G 
{0, {1}, {2}}, and S ^T, are distinguishable. Next, {q^} U S and {q^} U S with 
i < j < m are distinguished by ca™~^. Also, {q,} and {(?i}U{2} are distinguished 
from {qm} U {1} by e, and {qi} U {1} from {g„i} U {1} by c. Therefore all 3m — 2 
subsets are distinguishable. □ 

We do not know whether the bound 3m — 1 can be reached. However, we have 
verified with GAP that it cannot be reached \i m — 2. 
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5 Star 



The following DFA plays a key part in finding bounds on the quotient com- 
plexities of stars of star-free languages. Let n ^ 3, and 2?„ = Vn{a,b,c,d) = 
{Q, {a, 6, c, d\, S, 1, {n — 1}), where Q = {1, 2, . . . ,n} and 

S{i, a) = i + 1 for i = 1, . . . , n — 1, S{n, a) = n, 

S{i, fe) = i — 1 for i = 2, . . . , n, (5(1, b) = 1, 

S{i, c) ~ i — 1 for i = 2, . . . , n — 1, (5(1, c) = 1, d{n, c) = n, 

(5(i, d) ^ n for i = 1, . . . , ri. 

Since all the inputs perform non-decreasing transformations, Vn is aperiodic. 

In Fig. m if we ignore state and its outgoing transitions, and also the e 
transition, then the figure shows the automaton I?7(a, b, c, d). With state and 
the e transition it depicts the e-NFA of L* . 




Fig. 4. e-NFA N oi L* , k{L) = 7. Transitions under d (not shown) are all to 
state 7. 

We first study r'„(a, b), the restriction of 'D„(a, b, c, d) to the alphabet {a, b}. 

Lemma 1. If n ^ 3, and L is the star-free language accepted by P„(a, 6), then 
k{L*) = 2"-i +2"-3 _ I 

Proof. Consider the subsets of {0} UQ in the subset construction of the DFA for 
L*. Since can only appear in {0}, the remaining reachable subsets are subsets 
of Q. The empty subset cannot be reached because there is a transition from 
each state under every letter. Since state n — 1 cannot occur without state 1, 
we eliminate 2""^ subsets. Because state n — 1 always appears with state 1, and 
state n can only be reached from state n — 1 by a, the subset {n} first appears 
with state 2, and afterwards, always with a state from {1, . . . ,n — 1}; hence {n} 
cannot be reached. Also, 1 and n cannot appear together without n — 1, because 
n cannot be reached by &, and 1 cannot be reached by a without including 
n — 1. This eliminates another 2"~^ subsets. So 1 -|- 2"^^ + 1 + 2"^^ subsets are 
unreachable, and k{L*) 2" 4- 1 - (2"^^ _^ 2"-3 + 2) = 2"-^ + 2"-^ _ i. 

Now turn to the reachable subsets, and note that subsets {0} and {1} are 
reached by e and b, respectively. 
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First, let P = {5 C {2,...,n - 2} | 5 7^ 0}. All singleton sets {i} g P 
are reached by a'^^ from {!}. Now let S = {si, . . . , Sk}, T = {sq, si, . . . , Sfc}, 
where < fc, 1 < sq < si < • • • < < n — 1, and h = n — 1 ~ Sk', then 

S^{1}U S+h ^{1}US {1} U "'-V' {so} U S. Thus any T G P 

can be reached from a smaller S* G P, and so all subsets in P are reachable. 

Second, let Q = {{1} U 5 | 5 G P}; then S {1} U S, as above, and ah 
subsets in Q are reachable. 

Third, let R = {{1, n - 1} U S* | = or S* G P}. If S* = 0, then {1, n ~ 1} 
is reachable from {1} by a"~^. Now suppose 5 G P is not empty. If i G 5, 

then {i} ""^"^ {l,n~ 1} {i,n}. So S {n} U 5. Now, if = n - 2, 

then {71} US A - U S+i A - 1} U 5. If Sk < n - 2, then 

{n} U 5 4 {n} U 5+i A {1, n - 1} U 5. In either case, S ''^^^ {1, 7i - 1} U 5, and 
all 2"~^ subsets in R are reachable. 

Fourth, let S = {{n} U T | T G P U R}. We have shown that S {n} U S, 

n—2 n— 2 

if G P. Since also {l,n - 1} {l,n - we have - 1} U S* 

{1, 77, — 1, n} U S'. Hence all 2""^ — 1 subsets {77} U T in § are reachable. 

Altogether, 2"~^+2"~^ — 1 subsets are reachable. It remains to be shown that 
all the reachable subsets are pairwise distinguishable. State does not accept 
ab, while n — 1 accepts it. Each state i with 1 ^ i ^ n — 2 accepts a"~^^* and 
each of these words is accepted by only that one state, and 77, accepts b. So any 
two subsets S and T ^ S are distinguishable. □ 

Theorem 4. For n ^ 2 there exists a quaternary star-free language L with 
k{L) = n such that k,{L*) = 2"^^ + 2"^^. For n = 1, the tight upper bound is 2. 

Proof. For n = 1, there are only two languages, and E*, and both are star-free. 
We have k(0*) ~ 2, and k{{S*)*) = 1. For 7i = 2, there are two star-free unary 
languages, e and aa*, and the bound cannot be met if \IJ\ = 1. If = {a,b}, 
then b*aS* meets the bound 3. For ri = 3, we analyzed all 3-state aperiodic 
automata using GAP. The bound 6 is met by 7)3{a,b,c,d) defined above, and 
bounds 5 and 4 are met by V3{a,b,c) and V^^a^b), respectively. These bounds 
cannot be improved. 

We now turn to the general case. We will show that the following sets of 
states are reachable in the nondeterministic automaton TV (see Fig. U) from the 
initial state 0: the set {0}, all subsets of Q containing {l,n — 1}, and all non- 
empty subsets of (5 \ (77 — 1). By Lemma [U wc can reach all these subsets by 
words in {a, 6}*, except {n} and the subsets of Q \ (77- — 1) containing {1, 77}. 

We have {1, 77 — 1} A {2, n} A {1, n}; hence {1, n} is reachable. Now consider 
{n} U S, where S = {si, S2, • ■ • , Sfc} G P. Let h ~ n ~ 1 — Sk] then using a'' we 
move to {1,77} U S+h, and by c'' wc reach T ~ {1,*^} U S. Since {n} U S is 
reachable by Lemma [U T is also reachable. Thus we can reach all the subsets of 
Q \ (?^ — 1) containing {1, 77} by words in {a, 6, c}*. The only set missing now is 
{n}, and it is reached by d. 
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In Lemma [U we have already shown that any two subsets S,T C_ Q such 
that T ^ S are distinguishable by words in {a, b}*. □ 

Table [T] summarizes our results for the quotient complexity of i* in case L 
is star- free. For unary languages, see Section [71 The figures in boldface type are 
known to be tight upper bounds. For ri = 4, we analyzed all 4-state automata 
with non-decreasing input transformations. Automata Vi{a,b,c,d), 'D4^{a,h^c), 
and X'4(a, &) meet the bounds 12, 11, and 9, respectively. The bounds 11 and 9 
cannot be improved in the class of automata with non-decreasing input transfor- 
mations. For the rest, the bounds for jZ"! = 3 and jZ"! = 2 are met by X'„(a, 6, c), 
and 2?„(a, 6), respectively. 



Table 1. Quotient complexities for stars of star-free languages. 
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\E\ = A 
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12 


24 


48 


96 


192 




2n-l ^ 2"-2 



6 Reversal 

For regular binary languages, the tight bound for reversal [12] is 2". For star-free 
languages the bound 2" — 1 can be met, but with \E\ = n — \ letters. 

Theorem 5. For each n ^ 1 there exists a star- free language L with quotient 
complexity n such that k{L^) = 2" — 1. For n = 1, the bound is met if \S\ ^ 1, 
for n = 2, if \IJ\ ^ 2, and for n ^ 3, if \S\ ^ n — 1. 

Proof. For n = 1 and S = {a}, a* is a witness. For n = 2 and S = {a, b}, E*a is 
a witness. We have verified using GAP that all star-free languages L with n = 2 
satisfy k{L^) ^ 3; hence this bound cannot be increased. 

Now let n ^ 3, and let 2?„ = (Q, Z, S, 1, E), where Q = {1, 2, . . . , n}, Z = 
{a, &, C3, . . . , c„_i}, E = {i € Q \ i is even}, and 

d{i, a) = i + 1 for i ~ 1, . . . , n — 1, d{n, a) = n, 
(5(i, fe) = i - 1 for i = 2, . . . , n, (5(1, b) = 1, 

Cj) = i for i 7^ j, S{j, Cj) = j — 1 for j ~ 3, . . . ,n — 1. 

Since all the inputs perform non-decreasing transformations, 2?„ is aperiodic. 
Figure [5] shows the NFA J\f which is the reverse of DFA T>i . 

Assume initially that n is odd. Let 5* = {si, . . . , Sk} be a subset of Q, and 
letl^si<---<Sfc=^^- Then NFA A/" has the following properties: 
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b, C3,C4, C5, C(i 



C3, C4, C5, C6 



C3 , C5 , C6 



C3,C4,C5 



a, C3, C4, C5, C6 



C4 , C5 , C6 



C3,C4,C6 




Fig. 5. NFA A/" of 



n odd. 



PllfS^j^n— 1, jGS* and j — 1 ^ S, then input Cj deletes state j from S* 
without changing any of the other states. 

P2 If 3 ^ ^ n — 1, j ^ S, and j — 1 G 5, then input adds state j to 5 
without changing any of the other states. 

We now examine the sets of reachable states in N. The set O of all the odd 
states cannot be reached. For suppose that it is reached from some set S. If it is 
reached by a, then S must be a subset of E[j{n}. However, the successor under 
a of such a set S also contains n — 1 if it contains n. If we use b, then S must be 
a subset of i? U {1}. But then the successor of S also contains 2 if it contains 1. 
If we use Ci with i odd, then S must be a subset of O \ {i}, and S must also 
have i — 1. But then the successor of S also contains i — 1, which is even, if it 
contains i. If we use Ci with i even, then we also get i. 

If 71 = 3, there are no Ci inputs. Set {2} is initial, {1} can be reached by a 
and {3} by h. We can get by aa, {1, 2} and {2, 3} by ah and ha, respectively, 
and {1, 2, 3} by ahh. Set {1, 3} is unreachable. So assume 5. 

First, consider subsets S of M, the set of middle states; these are subsets of 
Q containing neither 1 nor n. If 2 e 5 start with E = {2, 4, . . . , n — 1}. By using 
inputs Ci, delete n — 1 or not, add ri — 2 or not, etc., until we reach 2, which 
cannot be removed by any Cj. If 2 5", then S'_(5j_2) has 2, is a subset of M, 
and so is reachable; then S is reached by h''^~^ from S'_(5j_2). 

Second, consider subsets S* of Q containing 1 but not n. If 2 G S, start with E 
and apply ah to reach {l}Ui?. Each state in E, except 2, is without a predecessor 
in {1} U E. Hence, by using inputs a, we can construct any such 5*. If 2 ^ S, 
start with E and apply a to reach O \ {n}, where O of all the odd states. By 
using inputs Ci, we can construct any such set S. 

Third, examine subsets S of Q containing n but not 1. If 2 £ S', start with 
E and apply h to reach E+i = {3, 5, . . . , n} = O \ {1}, and then apply a to get 
E U {n}. Construct any such set S using inputs q. If 2 ^ 5, then 5 is a subset 
of {3, . . . ,ri} containing n. Since the set 5-1 is a subset of M, it is reachable; 
then S is reached by h from S^i. 

Finally, consider subsets S containing both 1 and n. Apply haah to E to reach 
{1, n} U E. From this set we can reach any set containing {1, 2, n}. 

Now assume that 2^5. We now show that {i} U O is reachable for every 
even i > 2 in Q. Apply haa to E to reach {n — 1} U O. If i = n — 1. we are 
done; otherwise, delete n ~ 2 and n — 1 by c„_2 and c„_i in that order. Then 
insert n — 3 and n — 2 by c„_3 and c„_2 in that order. If z = n — 3, we are 
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done; otherwise, continue in this fashion. If we reach {3,4,5}, then i = 4, and 
the process stops. 

If ?7 = 5, then we can reach {1,3,4,5}. From {1,3,4,5} we can get {1,5}, 
{1, 4, 5}, and {1, 3, 4, 5}. We are missing only {1, 3, 5}, which is unreachable. 

If n 5^ 7; from {n — l}UO we can reach by Ci inputs all the subsets containing 
{l,n} but not {2}, except those subsets containing n — 2 without n — 1. From 
now on, we are interested only in the missing subsets, which are with {l,n}, 
without 2, and have n — 2 without n — 1. Then take {n — 3} U O. From here we 
can reach all subsets containing {l,n — 2,n} without {2,n — 1}, except those 
containing n — 4 without n — 3. If n = 7, then n — 4 = 3, and we are missing 
only {1, 3, 5, 7}, which is unreachable. 

Continuing in this fashion, we can reach all the subsets containing {1, n} but 
not 2, except O. Together with the case where 2 G 5, we have all the states 
containing {l,n}, except O. 

6, C3,C4,C5 C3,C4,C5 £4,05 C3,C5 C3,C4 a,C3,C4,C5 




Fig. 6. NFA Af of L^, n even. 

The case where n is even is similar. The NFA J\f is shown in Fig. |6] for n ~ 6. 
By an argument similar to that for n odd, O cannot be reached. 

Any subset oi M = Q\{l,n} can be reached as follows. If 2 £ 5, apply b to 
E to get 0\{1}, and then a to get to E\{7i}. Now any subset of M containing 2 
can be reached by inputs If 2 ^ S*, then any subset of M \ {2} can be reached 
from O \ {1} by inputs q. 

Second, consider subsets 5* of Q containing 1 but not n. If 2 e S", start with 
E and apply ba to reach E \ {n}. Then apply ab to get E \ {n} U {1}. Now any 
subset of {1} U M containing {1, 2} can be reached by inputs If 2 ^ S", start 
with E and apply baa to reach 0\{n — 1}. By using inputs c,;, we can construct 
any subset S of {1} U 71/ containing 1 and not 2, except the subsets that have 
{n — 3, n — 1} without n — 2. In case n = 4, we can reach {1, 2}, {1, 2, 3}, and 
{1}, but not {1, 3}. From now on, we are interested only in the missing subsets. 
As in the even case, we can get subsets containing {n — 3, n — 1} without n — 2 
by deleting n — 3 and n — 2, adding n — 4, and re-inserting ri — 3. Now we are 
unable to reach states having {n — 5, n — 3} without n~A. We verify that {i}UO 
is reachable for every even i with 4 ^ i ^ i — 2, and continue as in the odd case. 
We can keep moving this problem to the left, until we reach {3, 4, 5}. Then state 
4 cannot be removed because O is not reachable. 

Third, examine subsets 5 of Q containing n but not 1. If 2 G S', all such 
subsets are reachable by inputs q from E. If 2 ^ S, then 5 is a subset of 
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{3, . . . , n} containing n. Since 5_i is a subset of Af , it is reachable; then S is 
reached by b from S- 1 . 

Finally, consider subsets S containing both 1 and 71. If 2 S S, apply ab to 
reach {1} U E. From here we can reach any set containing {l,2,n} by inputs 
Ci. If 2 ^ S*, we reach O U {n} from E by a. From here we can reach any set 
containing {l,fT.} but not 2 by inputs q. 

We still need to verify that all the reachable subsets are pairwise distinguish- 
able. State i, and only state i, accepts a^~-^. Hence, if S',T C Q and S and T 
differ by state i, then they are distinguishable by a*~^. □ 

7 Unary Languages 

The case of unary languages is special. For regular unary languages, the tight 
bounds for each boolean operation K oL, product KL, star L* , and reversal 
are mn, mn, — 2n + 2, and n, respectively [18]. With the exception of the 
bound for reversal, these bounds cannot be met by star-free unary languages. 

Theorem 6. Let K and L be unary star-free languages with guotient complex- 
ities m and n, respectively. 

1. For each boolean operation o, k{K o L) ^ max{m,n) and the bound is tight. 

2. For product, k{KL) ^ m + n — I, and the bound is tight. 

3. For the star, the tight bound is 



4. For reversal, k{L^) = n. 

Proof. If a unary star-free language L is finite and k{L) ~ n, its longest word 
has length n — 2; if it is infinite, the longest word not in L has length n ~ 2. 

1. One verifies that k{K o L) ^ max(m,n). The witness languages are K = 
a™^^ and L = a"^^ for union and symmetric difference, K' = a™~^a* and 
L' = a'^~^a* for intersection, and K' and L' for difference, since K'\L' = K'nL' . 

2. One verifies that k{KL) ^ m -f n - 1, and K = a™~^a* and L ~ a"~^a* 
are witnesses. 

3. If L is infinite, then L D a"~^a*, and L* 3 a"~^a*; hence k(L*) < n. For 
n = 1,2,3,4,5, the bounds actually met in the infinite case are 1, 1, 3, 4, 5, 
respectively. If L is finite, it must contain a"~^, and if it has a, then k{L*) = 1. 
The tight bounds for finite unary star-free languages are 2, 2, 1, 2, 3, respectively. 
Hence the tight bounds for all unary star-free languages for the first five values of 
n are 2, 2, 3, 4, 5, and the witnesses are 0, e, a^a*, a^a*, and a'^a*, respectively. 

It was shown in [7] that for a finite unary language L, n{L*) ^ — 7n -t- 13 
for n ^ 5. For n > 6, this bound applies here, and a witness is a"~^ U a"~^. 

4. For unary languages, we have = L; hence k{L^) = k{L). □ 




1 



2, ifn = \- 

n, i/ 2 5C n ^ 5; 

In + 13, otherwise . 
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8 Conclusions 



Wc have shown that all the commonly used regular operations in the class of 
star-free languages meet the quotient complexity bounds of arbitrary regular 
languages. The only exceptions are in the product for n = 2, reversal, and 
operations on unary languages. 
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